Advancing forensic-based investigation incorporating slime mould search for gene selection of high-dimensional genetic data

Modern medicine has produced large genetic datasets of high dimensions through advanced gene sequencing technology, and processing these data is of great significance for clinical decision-making. Gene selection (GS) is an important data preprocessing technique that aims to select a subset of feature information to improve performance and reduce data dimensionality. This study proposes an improved wrapper GS method based on forensic-based investigation (FBI). The method introduces the search mechanism of the slime mould algorithm in the FBI to improve the original FBI; the newly proposed algorithm is named SMA_FBI; then GS is performed by converting the continuous optimizer to a binary version of the optimizer through a transfer function. In order to verify the superiority of SMA_FBI, experiments are first executed on the 30-function test set of CEC2017 and compared with 10 original algorithms and 10 state-of-the-art algorithms. The experimental results show that SMA_FBI is better than other algorithms in terms of finding the optimal solution, convergence speed, and robustness. In addition, BSMA_FBI (binary version of SMA_FBI) is compared with 8 binary algorithms on 18 high-dimensional genetic data from the UCI repository. The results indicate that BSMA_FBI is able to obtain high classification accuracy with fewer features selected in GS applications. Therefore, SMA_FBI is considered an optimization tool with great potential for dealing with global optimization problems, and its binary version, BSMA_FBI, can be used for GS tasks.

the examination of three distinct project scheduling challenges.Tolba et al. 51 presented a robust, improved forensic-based investigation (mFBI) optimization method for calculating the most efficient location of distributed generators (DGs) in electricity distribution networks (EDNs) to minimize the loss of power, as well as voltage deviations.Furthermore, hierarchical analysis is employed to derive the most relevant weighting factors for the multi-objective function (MOF).The efficacy of the proposed mFBI technique is validated and demonstrated through an investigation into the impact of DG integration on 118 IEEE EDN nodes and real Delta-Egypt EDN nodes.Chou et al. 52 suggested a forensic-based multi-objective investigation method for the multi-objective engineering optimization problem.Within this algorithm, the population undergoes initialization via chaotic mapping.Subsequently, Lévy flights, two elite groups, and a fixed-size file are employed to regulate the activities of investigators and police officers during offender search and frisking procedures.Simultaneously, a control time mechanism is integrated into MOFBI to harmonize exploration and exploitation, thereby attaining a Paretooptimal solution within the multi-objective search space.Experiments show that MOFBI can approximate the Pareto-optimal frontier more accurately than other algorithms.
Although many advanced and improved FBIs have been proposed, most of the existing improved algorithms still suffer from the question of slow convergence and the greater probability of falling into local optimal when solving some specific cases.In the original FBI algorithm, two phases are included in the process of criminal investigation: one is the investigation phase, and the other is the tracing phase.The two phases perform independent searches in their respective populations, and trapping into the same local optimal is possible.The search agent generated by the SMA can adaptively go beyond the local optimal and better find the optimal solution through the positive and negative feedback mechanism.This article is based on the original FBI version, and the search phase of the slime mould algorithm has been added to assist with the solution.During the proposed algorithm, the slime mould search mechanism is integrated as an independent inspector group that compensates for the shortcomings of the investigation and pursuit groups.The slime mould search mechanism dynamically adjusts the search patterns according to the probability of the suspect being at the location.When the probability is high, the slime mould search mechanism uses an area-restricted search methodology, which focuses on the identified area.If the probability of the suspect being at the location is initially found to be low, the slime mould search mechanism controls the search to jump out of the current area and look for other locations with a high probability of the suspect being there.Thus, this strategy can significantly increase the convergence speed of the algorithm and the capability of skipping local optimal.A new variant of the FBI, called SMA_FBI, is developed by incorporating the SMA strategy into the original FBI.After that, the binary version of the algorithm, i.e., BSMA_FBI, is obtained utilizing a conversion function, which is applied to the GS problem with high-dimensional data.
The remainder of this paper is divided into two sections: "Overview (FBI and SMA)" section describes the original FBI and SMA.In "Proposed SMA_FBI" section, SMA_FBI and the algorithm's time complexity are given a detailed description."Experiments" section gives and analyzes the experimental results."Discussion" section discusses the experiments as well as the results."Conclusion and future work" section summarizes the conclusions and gives some future directions.

Overview (FBI and SMA)
This section provides a detailed description of the FBI and SMA.

Forensic-based investigation (FBI)
The forensic-based investigation algorithm was inspired by Chou et al. from the investigation-localization-pursuit process of pursuing suspects by police officers involved in criminal investigations.It consists of two phases: the investigation phase (Step A) and the pursuit phase (Step B).The investigation phase is responsible for determining the location interval of the suspect in a general direction, while the pursuit phase requires a detailed search at the suspect location.The search space of the algorithm is defined as all possible suspect locations with the probability of locating the suspect as a metaphor for the objective function.In the search space, the investigator analyzes and evaluates the collected information to determine the identity and location of suspects, based on which the police can carry out an arrest.During this process, investigators and pursuers shift the direction of the search based on the latest evidence, thus requiring them to coordinate closely with each other throughout the process.The pseudo-code of the FBI appears in Algorithm1.The flowchart for the FBI is pictured in Fig. 1.
The original FBI can be categorized into several imperative steps: Step A1 Interpretation of Discovered Information.During this step, the investigative team analyzes the collected findings and initially identifies possible suspect locations, which can be inferred based on information related to X A i and other suspect points.In this work, each individual is influenced by the others.The new sus- pected location X A1 i is represented in Eq. (1).
Step A2 Determine the direction of the investigation.To establish the most likely suspect site, the investigator compares the probability of each suspect location to each other.p A i indicates the likelihood (objective value) that the suspect is located at position X A i , i.e., p A i denotes the objective value for the location X A i (i.e., p A i = fobjec- tive(X A i )).The investigator evaluates the likelihood of the new suspect's position and compares it to the location (1) www.nature.com/scientificreports/ of the current entrance.The site with the higher probability (objective value) of the suspect's presence will be reserved, while the other location will be discarded.The probability of each location is calculated by Eq. ( 2).
where p worst is the minimum probability of the existence of a suspect, p best is the maximum probability, and X best is the optimal location.The update of the search position will be affected by other suspicious positions, the direction of random selection is introduced on the basis of the optimal individual X best , to increase the diversity of the search area and expand the search space.The position update formula is shown below: where X best indicates the best position obtained in step A1, r 2 is a random value between 0 and 1; and d, e, f are three random values indicating the three suspicious positions.
Step B1 Begin the operation.In this stage, the arresting officer approaches the target location and arrests the suspect based on the best location provided by the investigation team.Each B i (pursuing officer) approaches the location with the best likelihood and updates the location if the newly approached location yields a better likelihood than the likelihood of the old location.
where r 3 and r 4 express two random values in the range of 0 and 1. j = 1, 2, . . ., D.
Step B2 Real-time position updates based on actions.While the tracking team is taking pursuit actions, it reports new suspect points to the headquarters in real-time.The headquarters will update the position and direct the tracking team to approach the suspect point.Each tracking team member works closely and interacts with each other.Agent B i approaches the target point and receives influence from team member B r at the same time.When the likelihood of B r is greater than the likelihood of B i , a new suspect location is generated according to Eq. (5); and vice versa according to Eq. ( 6).

Slime mould algorithm (SMA)
Many newly developed algorithms based on their respective properties and search mechanisms can balance exploitation and exploration well.SMA is an effective population-based algorithm proposed by Li et al. 28 .SMA principally emulates the behavioral and morphological changes of slime moulds when they feed.The algorithm uses weights to model the positive and negative feedback slime moulds generate during the foraging process, resulting in three different morphological types.Previous studies in many application scenarios [53][54][55][56] have demonstrated the superior performance of SMA in exploration and exploitation.

Approaching food
Slime moulds can approach food through odors in the environment.The following equation is used to model this contraction pattern of approaching food: where vb is a parameter between −a and a , and vc exhibits a linear decline from one to zero.T represents the contemporary iterations, and X b indicates the position with utmost concentration of odor discovered thus far.X refers the location of the slime moulds.X A and X B signify two fortuitously chosen individuals from the slime moulds.W symbolizes the weights assigned to the slime moulds.p is parameterized as follows: where i ∈ 1, 2, . . ., n, S(i) signifies the fitness evaluation of X , while DF indicates the most superior fitness achieved across all iterations.vb is given by the following equation: W relies on the utilization of the following formula: where condition refers to when S(i) is ranked within the superior half of the population, r signifies a randomly selected value between 0 and 1. max_t represents the maximum iterations, and bF and wF refer the best fitness and worst fitness, respectively, achieved during the current iteration.smellIndexindicates the sequence of fitness values for the ranking (which goes up in the minima problem).

Wrapping food
This section models the shrinking pattern of the venous tissue structure as the slime mould searches for food.
As the concentration of food that the vein is subjected to increases, the intensity of the wave produced by the biological oscillator amplifies, the speed of cytoplasmic flow accelerates, and the thickness of the vein augments.Equation ( 13) provides a mathematical representation of the positive and negative feedback relationship between the width of the veins of the slime mould and the food concentration, where the parameter r models the uncertainty in the contraction pattern of the veins.Including a logarithmic function slows the change in frequency so that no drastic changes occur in contraction frequency values.conditionemulates the fact that the slime moulds dynamically tune their search pattern due to the concentration of food.In conditions of elevated food concentration, the weight of the neighborhood increases; vice versa, the weight of the vicinity decreases, prompting slime moulds to venture into alternative regions for exploration.
The trends of vb and vc.
where lb and ub denote the upper and lower boundary within a search range.rand and r represent random vari- ables encompassing values inclusively between 0 and 1. rand is used as a key parameter to control whether or not to enter the stochastic update, and r determines whether or not to entry into the exploration and exploita- tion phase.Additionally, to adhere to the original text, the parameter z is specifically assigned the value of 0.03.

Oscillation
Slime moulds rely heavily on propagating waves produced by a biological oscillator to manipulate the movement of cytoplasmic flow within the vein to a location that favors food concentration.To emulate the alterations in the pulse width of slime moulds, W, vb, vc were used.W mathematically modeled the rate of oscillations of slime moulds relative to varying food concentrations.Consequently, this facilitated the slime moulds' ability to approach regions of higher food quality swiftly.In conditions of the food concentration was lower at certain locations, the slime moulds approached the food more slowly, thus improving the capability of the slime moulds to select the best food source efficiently.
The variable vb exhibits random oscillations between [−a, a] , but steadily converges to 0 as the number of itera- tions increases.Similarly, the variable vc undergoes oscillations within the interval of [−1, 1] , and eventually converges to 0. The visual representation of this behavior can be observed in Fig. 2. The synergy between vb and vc simulates the selection patterns demonstrated by slime moulds.The slime moulds will explore some areas independently to find better food sources.The slime mould will branch out to search for better food sources instead of concentrating on one food source.This strategy ensures that the slimy bacteria algorithm will not easily fall into a local optimum.

Proposed SMA_FBI
In this section, we provide a detailed description of the improved FBI and the time complexity of the algorithm in conjunction with the previous chapter.

Enhanced FBI with the SMA (SMA_FBI)
Except for the population size and the evaluation of stopping conditions, the base FBI does not rely on predefined parameters, so the parameters do not affect the algorithm's behavior.At the beginning of the FBI algorithm, www.nature.com/scientificreports/ the population was replicated into two parts, and each part independently searched for the optimal solution, interacting information through the current optimal position and the best fitness value.However, this led to barriers to communication within the algorithm, and the two parts did not exchange other information.Kaveh et al. 47 improved FBI by enhancing the exchange of information between the two parts.Instead of attempting to enhance communication, we added an inspector group to the investigating and searching groups, in which we introduced the search mechanism from the SMA.When searching for food, the slime moulds can adaptively tune their search pattern due to varying levels of food concentration.In conditions of elevated food concentration, the slime moulds concentrate their search on the currently recognized food sources; if they find a low food concentration, the slime moulds depart from the food source and search for other food concentrations.Throughout this process, the front end of the slime mould extends out and is able to build a network of veins in the search space, and the quality of the food source affects the propensity of the slime mould to search.
Using this property of slime moulds in FBI, an inspector group can be reconstructed in addition to the original investigation group and pursuit group, which can make up for the deficiencies of the first two stages and enhance the capacity of the FBI.The flowchart of SMA_FBI is displayed in Fig. 3

Computational complexity analysis
The time complexity of SMA_FBI is primarily associated with the dimension ( D ), the number of suspect locations ( NP ), and the number of evaluations ( GEN ).Overall, the time complexity is calculated from 4 aspects: initialization, fitness assessment, location update, and slime mould update strategy.For the initialization of the suspect location, the time complexity is O(D * NP) , the time cost of the adaptation assessment is O(NP) , the location updating part includes the investigation phase as well as the pursuit phase, and the time complexity of each phase is 2 * O(NP * D) , and the time consumption of the mucus updating strategy is O NP * 1 + logN + D .Considering the total number of evaluations GEN , the total time complexity of

Experiments
In order to assess the efficacy of the proposed algorithm SMA_FBI, a substantial quantity of experiments is undertaken in this section.Firstly, SMA_FBI is compared with 10 other original and 10 improved algorithms on 30 benchmark functions in CEC2017 57 .Benchmark datasets serve as widely acknowledged instruments for assessing the performance of various technologies against uniform criteria 58,59 .These datasets facilitate the evaluation of different technological dimensions, determining which technology excels over others across multiple domains 60,61 .Secondly, the algorithms are tested on different number of evaluations as well as different population sizes while controlling for other variables, and the complexity of SMA_FBI is also investigated and explained.
Finally, the effectiveness of SMA_FBI in practical applications is tested on the GS dataset.
To emphasize the impartiality of the experiments, all comparison algorithms undergo testing within the same hardware environment.Within the continuous optimization experiments, metaheuristic algorithm parameters are configured with a population size set at 30, and a maximum of 300,000 assessments.At the same time, to mitigate the influence of randomness on the experiments, all the algorithms are repeated on the test function for 30 times.Based on the experimental data, the capability of the comparison algorithms was evaluated using the mean ( avg. ) and standard deviation ( std.) of the optimal function values.The best results in the data are shown in bold.The nonparametric statistical test Wilcoxon signed-rank test 62 was utilized to ascertain whether SMA_FBI exhibits statistical superiority over other algorithms, with a significance level set at 0.05.The symbols " + / = /−" denote the proposed algorithm's superiority, equality, or inferiority to the other algorithms.Consistency was assessed using the Friedman test 63 to rank the mean experimental results and list the average ranked value (ARV).Accurate validation of any proposed model or algorithm must be done based on known parameters and settings.In the next experiments, all the parameter settings of the compared algorithms will be listed separately.
The experimental results of SMA_FBI in terms of function optimization are shown and analyzed in this subsection.Thirty functions of CEC2017 are selected as test functions, and the specifics of these functions can be found in Appendix A.1.Within this selection, F1-F3 represent unimodal functions, F4-F10 are associated with multimodal functions, F11-F20 pertain to hybrid functions, and F21-F30 are linked to composite functions.The unimodal function contains a single global optimal solution, which serves as a means to assess the algorithm's exploitation capabilities.Meanwhile, the multimodal function has multiple locally optimal solutions and is employed to evaluate the algorithm's capacity for global exploration.The hybrid and composite functions gauge the algorithm's equilibrium between exploitation and exploration.
All assessments were conducted on a Windows Server 2012 R2 datacenter operating system equipped with 128 GB of memory, utilizing an Intel (R) Xeon (R) E5-2650 v4 (2.20 GHz) CPU, within a MATLAB R2014b programming environment.
In "Parameter sensitivity analysis" section, experiments are analyzed for different evaluation numbers as well as population size."Comparison with conventional algorithms" and "Comparison with state of the art algorithms" sections entail comparisons of SMA_FBI with 10 original algorithms and 10 enhanced algorithms, respectively, aimed at substantiating SMA_FBI's performance in addressing exploration and exploitation in the context of CEC 2017.Furthermore, in "Experiments on real world optimization of GS" section, SMA_FBI is used to handle the GS problem for a dataset from the UCI database.

Parameter sensitivity analysis
In order to enhance the analysis of the algorithm's parameter sensitivity, the impact of population size and the number of evaluations on the algorithm is examined by manipulating individual parameters while holding other variables constant.
During this phase of the experiment, in order to reflect the comprehensiveness of the experiment, four different functions, namely, unimodal function F3, multimodal function F7, hybrid function F14, and composite function F29, were selected for verification.The population sizes were set to 10, 30, 60, 100 and 200 to study the influence of population size on the algorithm's performance.Based on the findings presented in Appendix A.2, the optimization effectiveness of SMA_FBI generally surpasses that of FBI.Moreover, when the population size is 30, the optimization effect of SMA_FBI reaches the optimal value, while the optimization effect of SMA_FBI is relatively poor when the population size deviates from 30.
Another pivotal factor influencing the experimental results is the number of algorithm evaluations.We selected five evaluation times, 50,000, 100,000, 150,000, 200,000, and 300,000, to investigate the effect of evaluation times on the property of SMA_FBI.Similarly, the validation is carried out in four functions: unimodal function F3, multimodal function F7, hybrid function F14, and composite function F29.From Appendix A.3, it can be seen that SMA_FBI has achieved the optimal value on the composite function before 50,000 evaluations.For the multimodal and hybrid functions, the optimal value is already close to the optimal value at 200,000 evaluations, but only at 300,000 evaluations can all the functions take the optimal value.In conclusion, we chose 300,000 evaluations.

Comparison with state-of-the-art algorithms
Within this experiment phase, the same CEC2017 benchmark function test set has been chosen to evaluate the capability of SMA_FBI in correlation to 10 state-of-the-art algorithms, namely EPSDE 68 , ALCPSO 69 , BMWOA 70 , CLPSO 71 , IGWO 72 , CESCA 73 , RDWOA 74 , LSHADE 75 , CBA 76 , and DECLS 77 .These 10 algorithms contain improved versions of various algorithms, especially of DE, PSO.EPSDE and LSHADE are two champion algorithms that have performed well in the field of evolutionary algorithms, and the superior performance of the proposed algorithm can be verified by comparing it with ten algorithms including these two.Table 2 shows the detailed parameter settings of the algorithms mentioned above.Appendix A.6 shows the comparison outcomes between SMA_FBI and the above advanced algorithms.
As can be noted from the summed rankings in Appendix A.6, SMA_FBI is still number one, even in the face of competition from the most highly acclaimed algorithms.In some of the previous functions of CEC2017, SMA_FBI did not achieve the best result compared to the champion algorithm, but it took the better solution.Furthermore, SMA_FBI achieves the optimal solution on most of the composite functions, i.e., F23, F25-30, and the sub-optimal solution on F24, and the std is 0 on all these functions, which indicates that SMA_FBI is more stable as well as robust on the composite functions.It shows that introducing the search mechanism of the SMA makes the algorithm more balanced between exploitation and exploration.The results of the Wilcoxon signed-rank test, comparing SMA_FBI with other state-of-the-art algorithms, are depicted in Appendix A.7. From the table in Appendix, it can be observed that in the experiments of SMA_FBI with BMWOA, IGWO, CESCA, CBA, the p-value is much less than 0.05, which proves that SMA_FBI outperforms these algorithms.Meanwhile, compared with other algorithms, most are also less than 0.05, which shows that SMA_FBI has apparent advantages over them.
As shown in Fig. 5, SMA_FBI also shows competitive performance compared to the state-of-the-art and improved algorithms, proving that SMA_FBI is more competitive.The comparison algorithms also contain www.nature.com/scientificreports/some improvements of DE and PSO algorithms, which further verifies that the introduction of the slime mould mechanism serves as a significant enhancement to the FBI.In summary, in the face of competition from the challenging state-of-the-art algorithms, the optimization ability of SMA_FBI is reflected in the overall optimization performance in different types of functions, especially in composite and hybrid functions.The slime mould search mechanism, as the third search scheme in the improved algorithm, enhances the algorithm's exploration and search capability as a whole.

Experiments on real-world optimization of GS
In this section, we employ the proposed algorithm SMA_FBI to address the GS problem and showcase the improved algorithm's effectiveness.Whereas the GS problem is a binary optimization task, we adapt the continuous SMA_FBI into a discrete variant, i.e., BSMA_FBI, to solve the high-dimensional GS problem.

Basic information
The GS problem requires selecting a set of most representative subsets from a collection of features for the purpose of dimensionality reduction of a dataset.GS can effectively reduce the computational cost of data, so many domains with large datasets wish to downsize application data.
In SMA_FBI based GS algorithm, x = x i,1 , x i,2 , . . ., x i,n represents a set of features, if x i,1 = 1 , it implies that the i th feature is selected; otherwise, the feature is not selected.GS represents a discrete optimization problem; therefore, converting the SMA_FBI algorithm to a binary version is necessary.We utilize a transfer function to convert continuous SMA_FBI to binary SMA_FBI(BSMA_FBI).The machine learning algorithm is employed in a classification capacity, and its classification accuracy is utilized to evaluate the ability of BSMA_FBI to screen important features in the dataset.In addition, during the evaluation process, cross-validation was employed to assess the optimum subset of features used for classification to avoid the impact of random elements on the experiment.

Fitness function and implementation of experiments
In previous work on continuous optimization, the proposed SMA_FBI searches for optimal solutions in a continuous search space.Whereas the GS problem is a binary problem, such a problem requires that the solution must be binary, i.e., it can only take either 0 or 1.However, many optimization algorithms are inherently designed for continuous spaces.Therefore, we need a way to convert the outputs of these continuous optimizers to binary values to satisfy the requirements of the problem.The transfer function (or threshold function) is the key to this conversion.The basic idea is to set a threshold value for the output of the continuous optimizer, and then convert the output to 0 or 1 according to this threshold value, with 1 indicating selected and 0 indicating unselected.By adjusting the threshold, we can control the stringency of the selected genes.A higher threshold will result in fewer genes being selected, while a lower threshold may result in more genes being selected.Here we choose a threshold of 0.5 as shown below: X j i is the i th value of the current search entity in the j th dimension within the discrete space.The transfer function is a proper translator that converts a continuous optimization algorithm into a discrete variant of the algorithm without altering the structure of the algorithm, which is convenient and efficient.Within this paper, V -type transfer function is employed, and its transfer function is shown below: GS is a process of obtaining the lowest classification error rate employing the least subset of features, which needs to be achieved simultaneously.Evidently, the GS problem presents itself as a multi-objective optimization challenge, and to satisfy each objective, a fitness function can be designed using the classification error rate and the number of selected features to evaluate the chosen feature subset.The specific form of the fitness function is shown below: where error represents the classification error rate computed by the K-Nearest Neighbor (KNN) 78 classifier, l signifies the size of the selected feature subset, and d is the total number of features in the dataset.Meanwhile, a and b serve as two weighting factors indicating the significance of the classification error and the subset length, respectively, to the GS problem.Our study asserts that the classification error rate deserves more attention than the feature subset length.Thus, we assign a to be 0.95 and b to be 1 − a , i.e., 0.05.Each feature subset is evaluated based on fitness, with smaller fitness values indicating superior feature subsets.

Experimental results and analysis of FS
The SMA_FBI based GS method, which we refer to as BSMA_FBI, will be in the face of competition from several state-of-the-art GS methods on 18 datasets, including bGWO 79 , BBA 80 , BGSA 81 , BPSO 82 , bALO 83 , BSSA 84 , bHHO 85 , and the original GS algorithm for the FBI, BFBI.These algorithms used for comparison are more classical algorithms and are commonly used in comparison experiments.They include many different kinds of ( 14) www.nature.com/scientificreports/algorithms such as nature-inspired algorithms, algorithms inspired by physical phenomena, and so on.Table 3 lists the detailed parameters of these classifiers.GS based on the SMA_FBI algorithm is performed on each dataset and is run N times, and tenfold cross-validations are performed each time.The data samples are partitioned into training, validation, and test sets in the cross-validation procedure according to a certain ratio.In this paper, the KNN classifier is used for classification.The classifier initially undergoes training and classification on all the data within the training set, subsequently assessing and validating against the samples in the validation set, and ultimately applies the chosen features to the test data to ascertain computational accuracy.Table 4 lists 18 detailed features from the UCI dataset, including the number of instances, features, and categories.As can be observed from the table, these datasets have 32-6598 samples, 23-15,010 features, and 2-26 classes.These datasets essentially represent different types of data, containing both small high-dimensional samples and large low-dimensional samples, which challenges the performance of the algorithm.
Appendix A.8-Appendix A.11 reflects the statistical findings of the means by the number of features selected, error rates, fitness values, and computation time.The bolded values represent the most favorable outcomes for the present comparison results.Examination of Appendix A.8 distinctly illustrates that the proposed BSMA_FBI selects the fewest features across nearly all datasets and achieves the second least number of features on the Parkinson, Lungcancer_3class dataset.By comparing the data of BSMA_FBI with BFBI, we can also find that our improvement of FBI is very effective, and our proposed algorithm selects fewer features and fetches better results than the original algorithm.The ARV metric shows the ranking results of various algorithms on multiple datasets, and there is no doubt that BSMA_FBI is ranked first.This shows that BSMA_FBI is competitive in selecting the least features.
According to the ARV, comparison results in Appendix A.9, as a whole, BSMA_FBI has not achieved the optimal results, but it has also achieved a suboptimal ranking, with an average error that is only a little bit higher than that of bGWO, and the average error value of BSMA_FBI is noteworthy lower when compared with that of BFBI.The proposed algorithm achieves the least average error on more than half of the datasets and has the smallest standard deviation, even though many of them are 0.This indicates that the proposed algorithm is very stable, which also proves the algorithm's superior behavior.Of course, we can also see that BSMA_FBI achieves relatively poor results on some algorithms, especially Tumors_9, Tumors_11, and Tumors_14, and we speculate that it may be the fact that these three datasets contain too many categories, which leads to the algorithm's general effect.
Appendix A.10 demonstrates the fitness values for the algorithm comparison, i.e., the weighted results of the error rate versus the number of features, from which it is evident that the data is mainly in line with the trend in Appendix A.9, although BSMA_FBI achieves the best results due to the addition of the number of features as a factor.The ARV results show that BSMA_FBI outperforms bGWO, and both significantly outperform the other optimizers, with BFBI having the worst results.This proves that the entry of the slime mould algorithm as a mechanism that improves the effectiveness of the original algorithm in searching for suitable features in the feature space has a positive impact.Based on the average computation time results in Appendix A.11, it can be observed that although the BSMA_FBI algorithm has a high computation time, it is still superior to BFBI, which can also prove the value of the improvement side by side, reducing the time cost.
Tables 5 and 6 show the Wilcoxon signed-rank test results of BSMA_FBI against other GS optimizers in terms of classification error as well as the number of features selected, respectively.From Table 5, it can be seen that there seems to be no significant difference between BSMA_FBI and other GS optimizers in terms of classification error and only in a few tests is the p-value less than 0.05.However, it can be seen from Table 6 that there is a significant difference between BSMA_FBI and other gene selection optimizers in terms of the number of features selected.This indicates that SMA_FBI has a significant advantage over these algorithms.
The images are more intuitive and visual than the data in the tables.Figures 6 and 7 show the optimal fitness values calculated by multiple algorithms during the optimization process in the form of curves.The horizontal

Discussion
This section summarizes the experimental findings of the proposed SMA_FBI on continuous function optimization and GS problems and provides a detailed analysis of the nature of the algorithms involved as well as the experimental results.The experimental part in "Experiments" section can be divided into three aspects: (1) Comparative experiments on the population size and on the number of algorithm evaluations in function optimization to find the most suitable population size and the number of evaluations; (2) On the CEC 2017 dataset, the correctness of the introduction of the slime mould search mechanism for the FBI is verified by comparing it to the base algorithm and state-of-the-art algorithms as well as the overall SMA_FBI algorithm's superiority.
(3) SMA_FBI is applied to a high-dimensional GS optimization problem from UCI data to prove the algorithm's behavior for effective dimensionality reduction of high-dimensional data and addressing discrete combinatorial optimization challenges.
From the perspective of function optimization, it can be seen from Appendix A.4, Appendix A.5 as well as Fig. 4, that the algorithm after the integration of the slime mould mechanism is superior compared to the FBI, and its optimization ability is stronger.Secondly, the SMA_FBI algorithm can occupy an obvious advantage, no matter it is compared with the classical DE, PSO, or the novel SMA.In addition, according to the data in Appendix A.6, Appendix A.7 and the curves in Fig. 5, we can see the comparison between SMA_FBI and a variety of improved algorithms, which contain several champion algorithms (EPSDE, LSHADE), as well as other stateof-the-art and improved algorithms (e.g., ALCPSO, DECLS), and so on.We can see that SMA_FBI significantly outperforms these state-of-the-art algorithms.At the same time, we can also see that SMA_FBI is not optimal in some problems, especially in the hybrid function, which is not obvious when comparing with the basic algorithms, but when comparing with the state-of-the-art algorithms, this problem becomes evidently apparent.In discrete combinatorial optimization, SMA_FBI achieves satisfactory results for GS problems.We evaluated BSMA_FBI (the binary version of the algorithm) as well as several GS optimizers using 18 datasets from the UCI repository (containing different types of data).Appendix A.8-Appendix A.11 quantitatively analyzes the performance of the algorithms in the four aspects of the number of selected features, classification errors, fitness values, and time cost, respectively.It is readily apparent that BSMA_FBI surpasses the other optimization techniques, and the proposed algorithm maintains high classification accuracy while selecting fewer features.It can also be seen that BFBI and BSMA_FBI algorithms are ranked in the bottom two positions in terms of time cost, while bGWO is also effective and has a higher time complexity.Given the significant reduction in time cost compared to the original algorithm, it is justifiable to take satisfaction in the performance of BSMA_FBI.In addition, we can see that BSMA_FBI is more effective on high-dimensional small-sample data but less effective on low-dimensional large-sample datasets and multi-classified data, which is the direction of our future improvement.In addition, Figs. 6 and 7 show that BSMA_FBI exhibits elevated classification accuracy and convergence at a superior rate compared to its counterparts.Thus, it shows that BSMA_FBI is a promising approach for discrete combinatorial optimization challenges in GS.
In brief, this article discusses the SMA_FBI algorithm, which incorporates a slime mould search mechanism based on the original FBI to achieve improved algorithm performance.By comparing with other excellent algorithms on the function optimization problem, it is found that SMA_FBI has a significant advantage in enhancing population diversity as well as convergence.In addition, compared with other GS methods on the GS problem, it is verified that BSMA_FBI can obtain higher classification accuracy while selecting fewer features.Of course, there is also a problem that BSMA_FBI has high time loss when performing GS, which is an optimization direction we need to consider afterward.Overall, SMA_FBI shows good prospects in addressing diverse optimization and GS problems.
Improving the accuracy and efficiency of gene selection plays a crucial role in medical diagnosis and personalized therapy, and has a profound impact on the development of drug discovery and individualized treatment.By improving the accuracy of gene selection, we are able to more accurately identify genetic variants associated with diseases and thus diagnose them more accurately.This helps to avoid misdiagnosis and underdiagnosis, and provide patients with more precise and personalized treatment plans.Meanwhile, in the process of drug development, by accurately selecting relevant genes, we can study the mechanism of action of drugs in greater depth, accelerate the process of drug development, reduce the cost of research and development, and improve the efficiency of research and development, so as to assist in medical diagnosis and personalized treatment.

Figure 5 .
Figure 5. Convergence curves of SMA_FBI and ten state-of-the-art algorithms on twelve functions.

Figure 6 .
Figure 6.Convergence plots for the BSMA_FBI and alternative binary metaheuristic algorithms across 9 datasets.

Figure 7 .
Figure 7. Convergence plots for the BSMA_FBI and alternative binary metaheuristic algorithms across 9 datasets.

Table 1 .
The specific configuration of parameters.

Table 2 .
The detailed parameter settings.

Table 3 .
Parameter settings for the classifiers.

Table 4 .
Characteristics of gene expression datasets.

Table 5 .
The p-value of the Wilcoxon test between the BSMA_FBI and alternative GS optimizers on average error rate.

Table 6 .
The p-value of Wilcoxon test between the BSMA_FBI and alternative GS optimizers on average number of the selected features.